A fuzzy search model for dealing with retrieval issues in some classes of dirty data

نویسندگان

  • Olufade F. W. Onifade
  • Oladeji P. Akomolafe
چکیده

Potential capital losses and heightened exposure are inherent in the usage of poor data quality management. Existing efforts like treating data as products; capturing metadata to manage data quality; statistical techniques; source calculus and algebra; data stewardship and dimensional gap analysis all failed in inculcating the contextual factors which a fuzzy in nature. The conventional manner of using information requires discrete values which are precise and devoid of ambiguity, however, this is not realizable as human being employs imprecise expression with high level of uncertainty or no clear boundaries to describe a situation e.g I am very hungry, it is going to be cloudy today. The bulk of the challenges to dirty data can be seen to stem from the “not missing, but wrong data”. These result from different data across database, ambiguous data, use of abbreviation or incomplete text and non-standard data which engulf different representation of compound data. This research employs fuzzy model to facilitate retrieval despite these myriads of dirty data problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proceedings of the 16th International Conference on Information Quality, ICIQ 2011, Adelaide, Australia, November 18-20, 2011

s for Keynotes A Practitioner's View of the Really Big Data Quality Research Issues 12 What Does the Next Generation of Business Models Mean for Information Quality? 35 Cloud Computing and Data Quality Services 43 Employing ISO9001 to Improve Water Information Quality in New South Wales 58 Data Quality in Shell: Building IQ Knowledge and Skills 88 A Journey Towards Enhanced Data Quality in Heal...

متن کامل

Fuzzy retrieval of encrypted data by multi-purpose data-structures

The growing amount of information that has arisen from emerging technologies has caused organizations to face challenges in maintaining and managing their information. Expanding hardware, human resources, outsourcing data management, and maintenance an external organization in the form of cloud storage services, are two common approaches to overcome these challenges; The first approach costs of...

متن کامل

Private Key based query on encrypted data

Nowadays, users of information systems have inclination to use a central server to decrease data transferring and maintenance costs. Since such a system is not so trustworthy, users' data usually upkeeps encrypted. However, encryption is not a nostrum for security problems and cannot guarantee the data security. In other words, there are some techniques that can endanger security of encrypted d...

متن کامل

Fuzzy efficiency: Multiplier and enveloping CCR ‎models‎

Comparing the performance of a set of activities or organizations under uncertainty environment has been performed by means of Fuzzy Data Envelopment Analysis (FDEA) since the traditional DEA models require accurate and precise performance data. As regards a method for dealing with uncertainty environment, many researchers have introduced DEA models in fuzzy environment. Some of these models ar...

متن کامل

Factors Affecting Student's Scientific Information Retrieval based on Fuzzy Logic Method Compared to Traditional Method

Background and aim: The aim of this study was to identify the factors affecting on students' performance in information retrieval based on fuzzy logic method compared to traditional method. Materials and methods: This survey-descriptive study was performed using quantitative approach. The research population was 34 PhD students, and the researcher-made questionnaire was used. Data were analyzed...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011